Classification of Hadiths using LVQ based on VSM Considering Words Orde

نویسندگان

  • Mohamed Ghanem
  • Abdelaaziz Mouloudi
  • Mohammed Mourchid
چکیده

The religion of Islam is based on a sacred text called Qur‟an, a divine speech expressed in Arabic language. Qur‟an constitutes the main root of Islam jurisprudence which has a second source of inspiration known as Hadiths. As the Muslim‟s life is governed by those holy texts, need of their authenticity is required. Using VSM (Vector Space Model), we can represent Hadiths as a vector of words. The Term Weighting obtained by multiplying term frequency by the inverse document frequency does not take into account the word order, however, order of narrators is critical to classify Hadith. In this paper we propose a new method considering the words order (in our case the narrator‟s order), to classify Hadiths into four categories: Sahih, Hasan, Da‟if and Maudu‟. We use in this purpose LVQ (Learning Vector Quantization). We got good results for classifying Sahih and Maudu‟ categories. General Terms Hadith categorization, Algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of Hadiths using LVQ based on VSM Considering Words Order

The religion of Islam is based on a sacred text called Qur’an, a divine speech expressed in Arabic language. Qur’an constitutes the main root of Islam jurisprudence which has a second source of inspiration known as Hadiths. As the Muslim’s life is governed by those holy texts, need of their authenticity is required. Using VSM (Vector Space Model), we can represent Hadiths as a vector of words. ...

متن کامل

Imam Sadegh’s (AS) Hadiths in Sunni’s lexicon

The Quran and Hadiths including Infallibles (AS) Hadiths such as Imam Sadegh (AS) were one of compilation references, and also, one of the fields of research for Arabs morphologists from long time ago. Imam Sadegh’s (AS) Hadiths based on Sunni’s lexicon, and then, based on another Islamic science books will be illustrated in this research in order to identify where these Hadiths hav...

متن کامل

Text categorization using topic model and ontology networks

Text categorization based on pre-defined document categories is one of the most crucial tasks in text mining applications in recent decades. Successful text categorization highly relies on the text representations generated from documents. In this paper, an innovative text categorization model, VSM_WN_TM, is presented. VSM_WN_TM is a special Vector Space Model (VSM) that incorporates word frequ...

متن کامل

Prototype-based minimum classification error/generalized probabilistic descent training for various speech units

In previous work we reported high classiication rates for Learning Vector Quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of Minimum Classiication Error (MCE) and Generalized Probabilistic Descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously deened classiication loss meas...

متن کامل

Lyric-based Song Sentiment Classification with Sentiment Vector Space Model

Lyric-based song sentiment classification seeks to assign songs appropriate sentiment labels such as light-hearted and heavy-hearted. Four problems render vector space model (VSM)-based text classification approach ineffective: 1) Many words within song lyrics actually contribute little to sentiment; 2) Nouns and verbs used to express sentiment are ambiguous; 3) Negations and modifiers around t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016